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As the rising of electricity demand, electricity load profile characterization 
(ELPC) is the integral aspect in planning, operating system, and distribution 
network development. The approach in the existing ELPC is still relatively 
macro in nature and does not involve other aspects outside the electricity 
variable, so the results tend to be biased for areas experiencing rapid land use 


changes. Therefore, this paper proposes an ELPC approach based on micro- 


spatial. Microspatial analysis is done by dividing area in the form of the 
Keywords: smallest grids involving various electrical, demographic, geographic and 
socio-economic variables, which are then grouped using adaptive clustering 
Se oe based on fuzzy C-means (FCM). The adaptive clustering algorithm is proven 
Descriptive statistic to be able to determine the degree of membership of each grid data against 
Fuzzy C-means each cluster with the ability to determine the number of clusters automatically 
Grid according to the attribute data provided. The ELPC results which consist of 5 
Load profile clusters are then analyzed using descriptive statistic, plotted, and mapped to 
obtain more accurate and realistic load characteristics in accordance with the 
pattern and geographical conditions of the region, so that the results can be 
used as a reference in load forecasting, network development, and distributed 
generation (DG) integration. 
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1. INTRODUCTION 

One key factor that determine the quality of electrical energy is the existence of the distribution system 
planning that meets applicable technical standards and criteria. That feasible plan will be able to provide 
optimum service flexibility, so it can anticipate the increased electrical energy needs and the served load 
density. The distribution system planning must also be in accordance with the load characteristic pattern [1], 
so that the location and capacity of the distribution substations that will serve the load area can be determined 
in accordance with the regional profile. 

Commonly, electrical load forecasting is used as one of the references in distribution system planning. 
Usually, many loads forecasting done has not considered to use load profile characterization (LPC) as its 
integral reference to get more accurate prediction result. Whereas the more precise load forecast can result in 
the better quality of network development planning, the better design of energy conservation, the increasing 
efficiency of the power system due to the high quality of load management strategies, precise customer 
classification, as well as precise marketing policies and electricity tariff [2], [3]. However, the usual LPC result 
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is based on macro and sectoral, making it difficult to map and determine the existence of load centers [4]. This 
will be even worse for areas that has rapid and dynamic land use changes which is usually in line with the 
increasing of electricity load as the result of both economic and population growth [5]. 

Therefore, this research provides a novel approach in determining load profile characterization by 
performing a micro-spatial-based adaptive fuzzy (FCM) clustering technique to identify the pattern of load 
needs, types, and characteristics of the load C-means served by a smaller area (micro-spatial) [6]. The clustering 
method is needed so the physical areas can be grouped into some classes that consist of another similar 
characteristic areas [7], [8]. Fuzzy C-means clustering is one of the algorithms of clustering besides another 
algorithms which are hierarchical, K-means, and dynamic [9]. The well-known clustering method is K-means 
which basically uses iterative method for clustering [8], [10], [11] then the result can be used to analyze the 
power consumption behaviour [12]. If the clustering is done by considering each data as a separate cluster and 
then merging the similar ones, it is called hierarchical clustering as done in some literatures [10], [13], [14]. 
Fuzzy clustering or fuzzy C-means algorithm is used in this research because it can eliminate the clustering 
inaccuracy because of overlapping clusters [15]-[17]. The cluster results then is analyzed for its load profile 
characterization. The statictical method based on time series data can be implemented to get the statistic 
descriptive [12]. Another approach of load profile characterization is by using the frequency domain analysis 
[18], but the accuracy depends on the data sampling frequency. 

The area studied in this research is a sub-district area of 120 sub-districts (grid). The low pin count 
(LPC) carried out considers data that consist of 12 variables, including electrical and non-electrical variables, 
from each sub-district. With so many grids and variables, the clustering method based on unsupervised learning 
using adaptive clustering algorithm is suitable to be applied because it does not require data labeling. Clustering 
is a grouping method based on unsupervised learning where the algorithm tries to find a certain structure or 
pattern in a data set, where the objects in each cluster show a certain degree of similarity [19]. Clustering 
method is widely implemented to classify some areas then identify electrical load profile characterization [6], 
[20]. Commonly that methods have 2 types: hard clustering and soft clustering. If a number of data are divided 
into different clusters where each data element is only a member of | cluster then the method is referred as 
hard clustering [21]. On the other hand if each data element is assigned a degree of membership for each cluster 
then the method is referred as soft clustering [22]. Soft clustering is more suitable to be applied because it 
represents the state of the data and the relationship between the variables. However, the analysis of the regional 
(spatial) electrical load profile that has been carried out by [23] only in which the clustering was done by using 
neural network. 

Therefore, in order to obtain the most accurate possible load profile grouping, the clustering method 
used in this study is soft clustering which is implemented in the adaptive clustering algorithm. In this case of 
LPC clustering, the algorithm applied for the grouping is adaptive clustering. The algorithm is proven to be 
able to determine the degree of membership of each grid data against each cluster with the ability to determine 
the number of clusters or the number of groups that have the best performance automatically according to the 
attribute data provided. 

This is very different from the conventional clustering algorithm where the clustering is done 
randomly or sequentially from the lowest number of clusters and then its performance is evaluated manually 
using the Silhouette cluster validation method. In addition, the LPC approach with this method is able to display 
and identify load characteristics based on geographic, demographic and economic structures for a smaller area 
by analytic-descriptive methods in each cluster, so that information relating to how large and where the 
electrical load centre is located can be mapped more accurately to be used in obtaining an optimal service 
pattern. 


2. METHOD 
2.1. Microspatial analysis 

Micro-spatial analysis is an area analysis based on land use simulation modeling, which begins by 
dividing the service area and electricity utility into a set of small areas [3], [6]. The size of the grid varies 
depending on the availability of data and the variables used. Information on the pattern of land use development 
is then translated into that the results will be more accurate. Based on Figure 1 we can can see details in pattern 
of development of load requirements, so that LPC can be determined in a small area. In this research, the 
division of areas in the form of grids is based on the existing condition of the area, up to the sub-district level. 
The variables used include the electricity variable that is the peak load per sector and also non-electricity 
variables which include geography, demography, and socioeconomics 
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Figure 1. The region division in micro-spatial area 


2.2. Soft clustering 
Soft clustering is more suitable to be applied because it represents the state of the data and the 
relationship between the variables. One common method of soft clustering is fuzzy C-means (FCM). The FCM 
algorithm is [11]: 
- Input the data as data X, in the form of a matrix of size n x p (n=number of data samples, p=attributes of 
each data). 


Xj = data sample k-th (k = 1,2, ... , n), data attribute j-th (j = 1,2,3,.., m). 


- Determine the number of clusters, weighting power, maximum iteration, expected error, and objective 
function and initial iteration. 


7 Generate random number (wiz, i= 1,2,..,c; k= 1,2, ... , n), as elements of the initial partition matrix U. 
Hii(%1)  Miz(%2) Mic (Xe) 
Up= : : : (1) 
Hyi(%1)  Mi2(%2) Mic (Xe) 


The partition matrix in fuzzy clustering must meet the following conditions: 
Hix =[0,1]; d<i<c1<k<n) (2) 
ea Mix = 11isisc 
0< Midi <cjlsk<n 
Count the number of each column (attribute): 
Qj = Lia Hix (3) 


with j = 1,2,3,..,m 
then calculate the degree of membership of each data. 


Mik = om (4) 


- Calculate the cluster centroid of the k-cluster: Vj, where i = /,2,3,....c andj = 1,2,3,...,m 


Ser (Hin) XK) 


Vi,= 
U Vha1Hin)™ () 
Vy10 "Vim 
veal: : , 
Ver ***— Vem 


7 Calculate the objective function, P, 
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- Calculate the partition matrix change: 


-1 
Pa (Xig-Vig) Pt 
Hin = eset) PP (7) 
Yea [EP_ hey—-Vig) PP 


- Check stops condition: 
a. If ((Pt — Pt-1 |< &) or (t< max iteration) so it is done. 
b. If not, then f=rf+/ repeat to step 4. 


2.3. Cluster validation 

Cluster validation or clustering validation index (CVI) is done to see the quality and strength of the 
cluster [24]. Optimal cluster results can be influenced by the clustering method used, the characteristics of the 
dataset, the structure of the data density, the size of the data, the number of clusters used. Thus, it is important 
to observe the quality of the cluster using CVI. There are many methods for CVI like Silhouette index [25], 
[26], Davies-Bouldin score, Calinski-Harabasz score, Dunn’s index [27], [28], and many more. Some 
literatures combined two CVIs [25], [26]. 

In this paper, the validation method used is the silhouette coefficient or silhouette index. The silhouette 
coefficient method is a combination of two methods, namely the cohesion method which functions to measure 
the proximity of the data in one group and the separation method which functions to measure the closeness 
between the formed groups. When applied, the Silhouette algorithm will measure the average distance from an 
object to all objects in the same cluster with objects in other clusters [29]. The silhouette plot shows the 
silhouettes of all groups so that the quality of the groups can be compared based on the width (darkness) of the 
silhouettes. The equation in determining the silhouette coefficient is: 


., _ _ b(é)-a(i) 
sO = max(a(i),b(i) (8) 


where s (i) = value of silhouette coefficient 
(1) = average distance of i-data 


b (i) = average distance of i-data with all members 
The interpretation of silhouette value is determined with the Silhouette Width Index as shown in Table | [30]. 


Table 1. Silhouette width index 
Silhouette Coefficient _ Interpretation between objects and groups formed 


0.7<SC<1 Strong structure 
0.5 < SC < 0.7 Medium structure 
0.25 < SC < 0.5 Weak structure 
SC < 0.25 No relation 


2.4. Adaptive clustering 

The novelty in this research is the application of the adaptive clustering algorithm. Adaptive is an 
appropriate term because the algorithm can automatically generate data grouping with the most optimal number 
of clusters based on the evaluation of cluster results. This is very different from the conventional clustering 
algorithm where the clustering is done randomly or sequentially from the lowest number of clusters and then 
its performance is evaluated manually using the cluster validation method. It is obvious that the conventional 
clustering methods are ineffective and more time consuming. 

The clustering algorithm used in this adaptive clustering method is fuzzy C-means with group 
evaluation based on silhouette validation. The concept of adaptive clustering can be seen in Figure 2. The 
output generated from the adaptive clustering algorithm is the most optimal number of clusters, members of 
each cluster, and the average silhouette value. From the flowchart in Figure 2, the clustering process and its 
cluster evaluation, are done simultaneously to give the most optimal cluster number based on any data input. 
This is more effective than processing the clustering and its evaluation as separated algorithm. 
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Figure 2. Adaptive clustering flowchart 


2.5. Area mapping 

The area mapping process is carried out to help when the available geographic data is large in number 
and size and consists of many interrelated themes. This process is carried out after a cluster is formed which 
contains grids that are analyzed to connect various data at a certain point on the earth, combine them, analyze, 
and finally map the results. The data to be processed is spatial data, the results obtained are geographically 
oriented data and are locations that have a certain coordinate system, as the basis for reference. 

Spatial data and plotting carried out such as land zones and topography on maps will have data 
attributes that contain information that can be adapted to LPC. The spatial data can also be combined with other 
spatial data so that it becomes layers containing complementary data. The use of Geographic Information 
Systems requires technical standards such as map projection systems and types of layers in order to maintain 
the level of accuracy and is useful in facilitating the planning and development of distribution networks. 


2.6. Load Profile Characterization 

Determination of LPC using descriptive statistical method for each cluster. Each member (grid) in the 
cluster will have an identical LPC. It will be displayed and analyzed in accordance with the variables entered. 
Descriptive statistics is a statistical analysis process that focuses on the management, presentation, and 
classification of data. With this process, the data presented will be easier to understand, and able to provide 
more meaning for data users. 

The graphic representation of descriptive statistics method to determine the LPC is shown in Figure 
3, where Figure 3(a) is skewness coefficient criteria and Figure 3(b) is kurtosis coefficient criteria. Both 
coefficient criteria can give the information of data distribution and its dominant value. The size of the slope 
(skewness) refers to the departure of the distribution from symmetry [31]. If the slope value is known, then it 
is also known how the distribution model is, whether symmetrical, negative, or positive. To calculate the slope 
coefficient, the following Person's moment coefficient of skewness formula is used [32]. 


a = af ° 


With a3 = skewness (Slope Coefficient) 
= mean 
m = median 
o = deviation standard 
Criteria for the form of skewness are positive model if the skewness value is more than zero; negative 

model if skewness is less than zero; symmetrical model if the skewness value is equal to zero. The measure of 
curvature (kurtosis) is the degree of height of the peak of a frequency distribution, usually relative to the normal 
distribution [33]. To find out the type of sharpness of the graph, the kurtosis value can be calculated using the 
following formula: 


_ law 


o* 


K (10) 
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with K = kurtosis coefficient 

n = amount of data 

x = data value 
The criteria for the distribution model of kurtosis can be categorized into three, those are [34]: platykurtic 
distribution if the kurtosis coefficient < 0.263; mesokurtic distribution if the kurtosis coefficient = 0.263; and 
leptokurtic distribution if the kurtosis coefficient > 0.263. 


(a) Negatively skewed (b) Normal (no skew) (c) Positively skewed 
Mean 
Median 
Mode Mode Mode 


Frequency 


Median Median 
Mean Mean 
4 Xx 


x 
- - The normal curve - - - 
Negative direction represents a perfectly Positive direction 
r symmetrical distribution 
(a) 


Platykurtic distribution Normal distribution Leptokurtic distribution 
Low degree of peakedness Mesokurtic distribution High degree of peakedness 
Kurtosis <0 Kurtosis = 0 Kurtosis > 0 
(b) 


Figure 3. Graphic representation of descriptive statistics method of (a) skewness and (b) kurtosis coefficient 
criteria 


3. RESULTS AND DISCUSSION 
3.1. Data 

The area studied in this research is a sub-district area of 120 sub-districts (grid). The LPC carried out 
considers data that consist of 12 variables, including electrical and non-electrical variables, from each sub- 
district. The electrical variables are load for each sector (residential, industry, commercial, and social) and load 
density. Non-electrical variables are total household, percentage of gross domestic product (GDP) growth, land 
use of each sector, and total area of sectors. The following tables (Tables 2 to 5) shows the data that contains 
12 variables. Those are showed only 10 sub-districts out of 120 sub-districts data used in this research. 


3.2. Clustering proccess 

Before performing the clustering process, the data must be normalized. The normalization is critical 
step to group the attributes of various entities in a relation to form a good relationship structure (without 
redundancy or data repetition) and most of the ambiguity can be eliminated. Data plot comparison in Figure 4 
shows the significant effect of normalization process to the data. It is clear from the results in Figure 4(a), 
which shows the plot of actual data, that the raw data are not evenly distributed to a point, so there tends to be 
ambiguity. However, after normalizing the data which is shown in Figure 4(b), the normalized data distribution 
forms a simple, non-redundant entity, so that it can be ensured that the data is in good quality for the next step. 
Clustering using adaptive clustering based on FCM methods and silhouette algorithm to find the best number 
of clusters. This process needs data input which is normalized data, minimum and maximum number of 
clusters, maximum iteration, and maximum error. By using adaptive clustering, the clustering processes start 
grouping the data into minimum number of clusters until the maximum number of clusters that determined in 
the initial step. Then, the result of each execution of clustering algorithm is evaluated by using silhouette 
methods. The best silhouette index is chosen to get the best number of clusters. 
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Table 2. Demographic and economic parameters 
Sub-district (Grid) _ >}; Household — GDP growth (%) 


Kemuning 2379 11.38 
Solear 1559 12.08 
Rajeg 1610 8.08 
Kedoya Utara 8351 99.87 
Duri Kosambi 16228 208.98 
Cibadak 2132 5.76 
Bitung Jaya 2808 4.43 
Cipete 1933 15.06 
Batu ceper 4436 36.75 
Kamal Muara 2083 99.96 


Table 3. Geographical parameters 


er : Land Use (Ha) Area 
Sub-disinicy (and) Residential _ Industry Business _ Social (Ha) 
Kemuning 39.8 2598.3 1732.2 73.6 4444 
Solear 97.5 2737.9 1825.2 89.4 4750 
Rajeg 85.1 1818.5 1212.3 86.1 3202 
Kedoya Utara 266.9 2.8 4.1 50.2 324 
Duri Kosambi 238.4 101.4 82.7 80.4 502.9 
Cibadak 136.3 1255.8 837.2 122.8 2352 
Bitung Jaya 167.3 931.2 620.8 = 150.7 1870 
Cipete 166.4 0.0 0.0 50.8 217 
Batu ceper 89.9 15.7 36.7 8.8 151 
Kamal Muara 84.3 183.3 72.7 713.2 1053.4 


Table 4. Electrical parameters 


a . Load (kW) Load density (kW/Ha) 
pub serene) Residential Industry Business _ Social 
Kemuning 141.8  12639.8 3168.9 289.0 3.654 
Solear 347.0 13318.6 3339.1 350.9 3.654 
Rajeg 302.9 8846.4 2217.8 337.8 3.655 
Kedoya Utara 4286.1 277.2 288.1 1019.7 18.120 
Duri Kosambi 3827.4 9969.1 5884.6 1633.8 42.384 
Cibadak 485.0 6108.9 1531.5 481.8 3.660 
Bitung Jaya 595.4 4530.0 1135.7 591.5 3.665 
Cipete 592.4 0.0 0.0 199.3 3.648 
Batu ceper 320.0 76.4 67.1 34.7 3.299 
Kamal Muara 1353.1 18023.0 5170.3 14489.4 37.057 


This research determined to cluster the data from 2 to 10 number of clusters that was executed 20 
times to ensure the consistency of the results to cluster the data. Figure 5 shows the plot of silhouette index 
after 4 times validity running. The graph is limited to show only 4 times validity running so the trend of the 
silhouette index from each clustering validity can be presented clearly. Based on the clustering that has been 
carried out several times, the number of clusters that often appear and form is 4 and 5 number of clusters. While 
the best silhouette index is 0.70414 for 5 number of clusters. The total members of each cluster, both from 4 
and 5 number of clusters, can be seen in Table 5. 

Cluster 1, 3, and 4 have the same total members. Although it has the same total members, the sub- 
districts listed are not exactly the same because of different centroid. For example, in cluster 4 that contains 51 
sub-districts from both 4 and 5 number of clusters, there are 38 sub-districts listed in both, while the rest 13 
sub-districts are different. The centroid for each variable for 4 and 5 number of clusters is shown in Table 6. 


3.3. Cluster validation 

To ensure the optimum number of clusters formed that is 5 clusters, then validation using silhouette 
algorithm is carried out using (8), the results of cluster validation are shown in the Figure 6. It shows the 
silhouette index for each sub-district. The value shows the interpretation between sub-district and groups 
formed. The good value for silhouette coefficient is close to +1, in which it means the grid is far from other 
groups formed. If the value equals to 0, the gris is on or very close to the decision boundary grid If the value is 
close to +1. More negative value means the grid is overlapping, so the grid is more appropriate to be grouped 
in another cluster. From Figure 6 the average value of the silhouette coefficient is 0.704141, based on Table 1, 
the silhouette value which is above 0.7 the interpretation between the objects formed is very strong structure. 
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Figure 4. The data plot comparison between: (a) actual data and (b) normalized data 
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Figure 5. Best silhouette index from clustering validity processes 


Table 5. Total members of each cluster 


Number of Cluster Total member of k-th cluster 
2 3 4 5 

5 Clusters 13 19 51 1 

4 Clusters 14 19 51 0 
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Table 6. Centroid for 4 and 5 number of clusters 


‘ Centroid 

MaHSD IG 5 Clusters 4 Clusters 
> Household -0,343 -0,339 
Area 1,770 1,761 
Land Use: Residential 0,263 0,265 
Land Use: Industry 1,893 1,878 
Land Use: Business 1,872 1,857 
Land Use: Social 0,444 0,508 
GDP Growth -1,295 -1,277 
Load: Residential -1,045 -1,036 
Load: Industry 1,607 1,620 
Load: Business 0,908 0,921 
Load: Social -1,288 -1,112 
Load Density -0,837 -0,802 

Sithoutte Value of Optimum Cluster 


Cluster 


Silhouette Value 


Figure 6. Cluster validation 


3.4. Area mapping 

The clustering results obtained are then plotted and mapped to see the distribution, location of the area 
of each grid in each cluster, to make it easier to perform LFC from each area. The results of mapping the area 
can be seen in Figure 7. From the Table 5, there is a cluster that only has 1 member that is Kamal Muara sub- 
districts. The location of Kamal Muara can be seen in Figure 7. Only that sub-district that clustered 
distinctively. From a brief observation, if a grid tends to make their own cluster with only that grid as the cluster 
member, it means the attributes of that grid are far different from another. From the raw data, Kamal Muara 
sub-district has rather unique characteristics where the social sector land use and social sector load are high, 
no other areas have such that characteristics. Geographically, Kamal Muara sub-district is located in the coastal 
area with the mangrove forest dominated the area. 


3.5. Load profile characterization 

LPC is obtained after all areas are clustered using descriptive statistical method for each cluster. The 
descriptive statistics used are skewness and kurtosis analysis using SPSS software. Table 7 shows the result of 
descriptive statistics analysis of cluster 4. 

From Table 7, electrical variables can be analyzed based on the value of skewness and kurtosis for 
household (residential) sector load, industrial sector load, business sector load, and social sector loads. The 
load analysis can be represented as a graph of skewness and kurtosis as shown in Figure 8. Thus, the load 
profile characterization of each sector is: 
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- Residential sector: the skewness value is more than zero and the kurtosis value is more than 0.263, which 
means that the residential load of each customer tends to be low but the number of consumers for 
residential sector tend to be high. In addition, it can be seen based on the graph of kurtosis and Skewness 
in Figure 8(a). 

7 Industrial sector: the skewness value is less than zero and the kurtosis value is less than 0.263, this shows 
that the industrial load of each customer tends to be high with a relatively small number of customers. 
The skewness and kurtosis graphs for industrial loads are described as Figure 8(b). 

- Business sector: the skewness value is less than zero and the kurtosis is less than 0.263, this is the same 
as the industrial sector, where the business sector load of each customer tends to be high with a relatively 
small number of customers. The frequency distribution of business expenses according to the following 
Figure 8(c). 

- Social sector: as seen in Figure 8(d), the skewness value is more than zero and the kurtosis value is more 
than 0.263, which means that the social load of each customer but the number of consumers for social 
sector tend to be high. 

Therefore, the characteristics of the electrical load in cluster 4 are dominated by residential loads 
because the kurtosis value is the highest among the other three variables, which is 0.76, so that the residential 
load kurtosis curve is sharper than the industrial load, business load, and social load. The same method can be 
implemented in another clusters. By knowing the load profile characterization of every cluster, the distribution 
system planning will be more specific in the level of sub-district, so that the location and capacity of the 
distribution substations that will serve the load area can be determined in accordance with the regional profile. 
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Figure 7. Mapping result of 5 clusters and the position of Kamal Muara sub-district 


Table 7. Result of descriptive statistics analysis of cluster 4 
Descriptive statistics 


bits Mean Skewness _ Kurtosis 
> Household 13348.46 0.82 0.98 
Area 509.75 0.22 -0.14 
Land Use: Residential 318.68 0.84 0.76 
Land Use: Industry 54.86 -0.25 -1.23 
Land Use: Business 61.33 -0.15 -1.13 
Land Use: Social 74.82 0.70 0.57 
GDP Growth 184.88 0.15 -0.59 
Load: Residential 5116.73 0.84 0.76 
Load: Industry 5394.27 -0.25 -1.23 
Load: Business 4362.94 -0.15 -1.13 
Load: Social 1520.06 0.70 0.57 
Load Density 31.88 -0.07 -0.56 
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Figure 8. Load kurtosis and skewness graph of (a) residential sector, (b) industrial sector, 
(c) business sector, and (d) social sector 


4. CONCLUSION 

The area studied in this research is a sub-district area of 120 sub-districts (grid). The LPC carried out 
considers data that consist of 12 variables, including electrical and non-electrical variables, from each sub- 
district. The grids must be grouped into some clusters that contains the cluster members which have the similar 
profile, using adaptive clustering. The proposed clustering method used in this adaptive clustering algorithm 
is based on fuzzy C-means with group evaluation based on Silhouette validation. Based on the clustering that 
has been carried out several times, the number of clusters that often appear and form is 4 and 5 number of 
clusters. While the best silhouette index is 0.70414 for 5 number of clusters. Thus, the output of clustering 
process was analyzed for its electricity load profile characterization (LPC) using descriptive statistical method 
for each cluster. LPC has done for cluster 4 that contains 13 grids (sub-districts) which gives the result that the 
electrical load in cluster 4 are dominated by residential loads where the number of customers is high but the 
load for each residential customer tends to be low. The same method can be implemented in another clusters. 
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